Picture for Weinong Wang

Weinong Wang

ChartArena: Benchmarking Chart Parsing across Languages, Scenarios, and Formats

Add code
May 31, 2026
Viaarxiv icon

PhoneWorld: Scaling Phone-Use Agent Environments

Add code
May 28, 2026
Viaarxiv icon

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

Add code
May 05, 2026
Viaarxiv icon

CF-VLA: Efficient Coarse-to-Fine Action Generation for Vision-Language-Action Policies

Add code
Apr 28, 2026
Viaarxiv icon

Ego-InBetween: Generating Object State Transitions in Ego-Centric Videos

Add code
Apr 20, 2026
Viaarxiv icon

Enhancing MLLM Spatial Understanding via Active 3D Scene Exploration for Multi-Perspective Reasoning

Add code
Apr 08, 2026
Viaarxiv icon

VideoTIR: Accurate Understanding for Long Videos with Efficient Tool-Integrated Reasoning

Add code
Mar 26, 2026
Viaarxiv icon

MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation

Add code
Mar 25, 2026
Viaarxiv icon

Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training

Add code
Mar 25, 2026
Viaarxiv icon

LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs

Add code
Sep 19, 2025
Figure 1 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 2 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 3 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 4 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Viaarxiv icon